AMENDMENT TO THE CLAIMS 



1. (Currently Amended) A speech processing system adapted 
to receive an input related to one of speech and text and 
process the input to provide an output related to one of 
speech and text, the speech processing system accessing a 
module derived from a phone set having a plurality of 
phones for a tonal language, the phones being used to model 
syllables used in the module, the syllables having an 
initial and final part, wherein at l east some of the 
sy llables of the tonal lan guage include a glide, the glide 
being embodied in the initial , and wherein the final part 
comprises a plurality of phones that jointly and implicitly 
carry the tonal information. 

2. {Original) The speech processing system of claim 1 
wherein each phone of the final part includes information 
about the tone* 

3- (Original) The speech processing system of claim 1 
wherein the tonal language comprises a plurality of 
different tones with different levels of pitch. 

4. (Original) The speech processing system of claim 3 
wherein the different levels of pitch comprise two 
categorical levels, and wherein each phone has a 
categorical level associated with it, 

5. (Original) The speech processing system of claim 3 
wherein the different levels of pitch comprise three 
categorical levels, and wherein each phone has a 
categorical level associated with it. 



6. (Original) The speech processing system of claim 3 
wherein the different levels of pitch comprise five 
categorical levels, and wherein each phone has a 
categorical level associated with it. 

7. (Original) The speech processing system of claim 1 
wherein each syllable comprises the same form having the 
initial and the final, the final having two phones carrying 
partial tonal information each, 

8. (Currently Amended) The speech processing system of 
claim 1 wherein each syllable is represented by three parts 
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comprising a first part hav ing information p ertaining to 
the in itial part, a se cond part having info rmation 
pertaini ng to a first phon e of the final part and a third 
part h aving informati on pertaining to a first phon e of the 
final part — at — least — some — e4 — fehe — syllables — e£ — the — tonal 

language — include — a---gl--i-de 7 — the — gjrir d c being cmb o d - i - ed - in--4rhe 

initia l. 

9. (Original) The speech processing system of claim 1 
wherein the speech processing system comprises one of a 
speech recognition system and a text-to-speech converter. 

10. (Original) The speech processing system of claim 9 
wherein each phone of the final part includes partial 
information about the tone. 

11. (Original) The speech processing system of claim 9 
wherein the tonal language comprises a plurality' of 
different tones with different levels of pitch. 



12. (Original) The speech processing system of claim 11 
wherein the different levels of pitch comprise two 
categorical levels, and wherein each phone has a 
categorical level associated with it. 

13. (Original) The speech processing system of claim 11 
wherein the different levels of pitch comprise three 
categorical levels, and wherein each phone has a 
categorical level associated with it. 

14. (Original) The speech processing system of claim 11 
wherein the different levels of pitch comprise five 
categorical levels, and wherein each phone has a 
categorical level associated with it. 

15. (Original) The speech processing system of claim 9 
wherein each syllable comprises the same form having the 
initial and the final, the final having two phones carrying 
partial tonal information, 

16. (Cancelled) 

17. (Original) The speech processing system of claim 16 
wherein the tonal language comprises Chinese or a dialect 
thereof, such as Cantonese. 

18. (Original) The speech processing system of claim 16 
wherein the tonal language comprises Thai or a tonal 
dialect thereof. 



19. (Original) The speech processing system of claim 16 
wherein the tonal language comprises Vietnamese or a tonal 
dialect thereof. 

20. (Original) A speech processing system adapted to 
receive an input related to one of speech and text and 
process the input to perform one of speech recognition and 
text-to-speech conversion in order to provide an output 
related to one of speech and text, the speech processing 
system accessing a module derived from a phone set having a 
plurality of phones for a tonal language comprising a 
plurality of different tones with different levels of 
pitch, the phones being used to model syllables used in the 
module, at least some of the syllables having an initial 
and final part, wherein a first set of the plurality of 
phones are used to describe glide dependent initials, and a 
second set of the plurality of phones are used to describe 
the final part, wherein the final part comprises a 
plurality of phones, each phone including partial tonal 
information . 

21. (Original) The speech processing system of claim 20 
wherein the different levels of pitch comprise two 
categorical levels, and wherein each phone has a 
categorical level associated with it. 

22. (Original) The speech processing system of claim 20 
wherein the different levels of pitch comprise three 
categorical levels, and wherein each phone has a 
categorical level associated with it. 



23. (Original) The speech processing system of claim 20 
wherein the different levels of pitch comprise five 
categorical levels', and wherein each phone has a 
categorical level associated with it. 

24. (Original) The speech processing system of claim 20 
wherein at least one syllable comprises only the final 
having two phones carrying partial tonal information each, 

25. (Original) The speech processing system of claim 20 
wherein each syllable comprises the same form having the 
initial and the final, the final having two phones carrying 
partial tonal information each. 

26. (Cancelled) 

27. (Cancelled) 

28. (Cancelled) 

29. (Currently Amended) A i aefetee - d r-co m pu t er readab le storag e 
media having instructions, which when implemen ted on a 
computing device perform e^speech processing comprising: 

accessing a module having a phone set comprising a 
plurality of phones for a tonal language, the 
phones being used to model syllables, the 
syllables having an initial and final part, 
wherein at least some of the syllables of the 
tonal language includ e a g lide; the glide being 
embodied in the initial , and wherein the final 



part comprises a plurality of phones that jointly 
and implicitly carry the tonal information; 
utilizing the phone set to identify syllables 
corresponding to the input for performing one of 
speech recognition and text-to-speech conversion; 
and 

providing an output corresponding to one of speech 
recognition and text-to-speech conversion. 

30. (New) The computer readable storage media of claim 29 
wherein each phone of the final part includes information 
about the tone. 

31. (New) The computer readable storage media of claim 30 
wherein each syllable is represented by three parts 
comprising a first part having information pertaining to 
the initial part, a second part having information 
pertaining to a first phone of the final part and a third 
part having information pertaining to a first phone of the 
final part. 

32. (New) The computer readable storage media of claim 31 
wherein each syllable comprises the same form having the 
initial and the final, the final having two phones carrying 
partial tonal information each. 

33* (New) The computer readable storage media of claim 32 
wherein the different levels of pitch comprise five 
categorical levels, and wherein each phone has a 
categorical level associated with it. 



